Skip to content

Conversation

CISC
Copy link
Collaborator

@CISC CISC commented Jul 27, 2025

Adds softcap fusion (scale->tanh->scale).

Minor refactoring to ggml_cuda_can_fuse to handle unary ops.

@CISC CISC requested a review from JohannesGaessler July 27, 2025 19:45
@github-actions github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jul 27, 2025
@CISC CISC requested a review from JohannesGaessler July 28, 2025 20:36
@CISC CISC merged commit 138b288 into master Jul 29, 2025
86 of 88 checks passed
@CISC CISC deleted the cisc/cuda-fuse-softcap branch July 29, 2025 12:22
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Aug 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants